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ABSTRACT 

Starting from the Fisher matrix for counts in cells, I derive the full Fisher matrix 
for surveys of multiple tracers of large-scale structure. The key step is the "classical 
approximation", which allows to write the inverse of the covariance of the galaxy counts 
in terms of the naive matrix inverse of the covariance in a mixed position-space and 
Fourier-space basis. I then compute the Fisher matrix for the power spectrum in bins 
of the three-dimensional wavenumber k; the Fisher matrix for functions of position x 
(or redshift z) such as the linear bias of the tracers and/or the growth function; and the 
cross-terms of the Fisher matrix that expresses the correlations between estimations of 
the power spectrum and estimations of the bias. When the bias and growth function 
are fully specified, and the Fourier-space bins are large enough that the covariance 
between them can be neglected, the Fisher matrix for the power spectrum reduces to 
the widely used result that was first derived by Feldman, Kaiser and Peacock (1994). 
Assuming isotropy, a fully analytical calculation of the Fisher matrix in the classical 
approximation can be performed in the case of a constant-density, volume-limited 
survey. 

Key words: cosmology: theory - large-scale structure of the Universe 



1 INTRODUCTION 



With the growing relevance, cost and complexity of galaxy 
survey s [York et al.| p000|; |Cole et al. | p005|); |Abbott 



et al. 



( 2005[); |Scovilie et al.| (|2007[); |Adelman-McCarthy 



etal. (2008a|b|);|PAN-STARRS||Bemtez et al. (20091; BOSS 



Abell et al.| ( |2009[ ); |SUMIREHBlake et aL| ( |201l7 ]7the scien- 
tific potential of these probes must be accurately projected. 
Since that potential is usually expressed in terms of con- 
straints on the currently favored theoretical models and their 
parameters, forecasting those constraints is a critical part of 



the design and justification of any new survey |Albrecht et al. 
(2009])]. 



One of the most interesting functions that one wishes 
to constrain with these galaxy surveys is the power spec- 
trum P(k), as well as its sub-products such as the baryon 
acoustic oscillations [Eisenstein et aT (19991; Blake & 



Glazebrook (20031; Seo & Eisenstein 



ories about 
correct 



1 2003[ ) j. If our the- 
the origin of structure in the Universe are 
Mukhanov| (|2005|l; |Peter fc Uzan| (|2009[)], the 



power spectrum should be given by the expectation value 
(8(k)S*(k')) = {2-K) 3 P(k)5 D (k - k'), where S(k) is (in lin- 
ear perturbation theory) a nearly scale-invariant Gaussian 
random field corresponding to the Fourier transform of the 
density fluctuation contrast, 8(x) — Sp(x)/p(x), and Sd is 



the Dirac delta function (in this paper position space is al- 
ways expressed in terms of comoving coordinates x). 

However, in the absence of information about gravita- 
tional lensing, which can probe directly the total masses of 
halos, all that we are able to measure with galaxy surveys 
are the positions of individual galaxies and other (biased) 
tracers of the underlying large-scale structure, over a finite 
volume and with a limited spatial accuracy. Hence, all es- 
timates for the power spectrum are based on the two-point 
correlation functions of these tracers, which are themselves 
proportional to the Fourier transform of the power spectrum 
- the constants of proportionality being the bias of each type 
of tracer. 

There are many theoretical and practical problems in- 
volved in the estimation of a Fourier space function, P(k), 
from (imperfect) measurements of its counterpart in position 



space, the two-point correlation function £(x) [see Bernstein 



(1994) for the issues that arise when trying to estimate £ 
directly from the data)]. First, the mechanism whereby one 
or more galaxies appear at the peak of a local density field 
leads to shot noise - i.e., statistical fluctuations typical of 
point processes, usually assumed to be of a Poisson nature. 
Second, the finite volume mapped in a real survey leads to 
sample (or cosmic) variance, which limits the accuracy with 
which we can estimate the power spectrum for any given 
mode k - or, equivalently, £ at a scale x. These concerns 
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imply that, when building an estimator for the power spec- 
trum, one must weigh each galaxy pair in such a way to min- 
imize the variance of that estimator, while ensuring that it 
remains unbiased. Although the present work aims to estab- 
lish a more firm basis for the basic tools used for estimation 
and forecast of parameters from galaxy surveys, it is impor- 
tant to recognize that in practice the "real world" problems 
can be even harder to tackle - for reviews, see IHamilton] 
( 2005a|b l. 



of the (fiduci al) power spectrum in the usual Vogeley & 
Szalay| |l996[)] way: Fjj = 1/2 Tr [ C" 1 C,i C" 1 C A . It was 



In a seminal paper Feldman, Kaiser and Peacock IFeld- 



then shown |Hamilt on (1997a); Tcgmar ket al.| ( |1998j )] that, 
under some assumptions and after some approximations, 
the Fisher matrix reduces to the FKP formula for the (in- 
verse) variance of the power spectrum. This important re- 
sult provides the connection between forecasts in a best- 
case scenario (which, because of the Cramer-Rao bound, 
are given by the Fisher matrix), and the estimation of the 
power spectrum from real data - e.g., the Fisher matrix- 



man et al.| fiMQ] (hereafter, FKP) showed that there is based quadratic methods |Tegmark et aL] ( |1998[ )] and/or di- 



an "optimal" estimator of the power spectrum, in the sense 
that the combined contributions from shot noise and sam- 
ple variance to the covariance of that estimator are mini- 
mized. That estimator takes the form of a weighting func- 
tion for pairs of galaxies which depends on the Fourier 
mode of the spectrum that is being estimated, U (x, k) — 
n(x)b 2 (x)D 2 (x)P(k)/[l + n(x)b 2 (x)D 2 (x)P(k)], where n is 
the average volumetric density of galaxies (sometimes also 
referred to in the literature, somewhat confusingly, as the 
selection function); b is the bias of those galaxies (which we 
will assume to be linear and deterministic); and D(x) is the 
linear matter growth function at the redshift z(x). By fac- 
toring out the linear growth function (which is normalized 
to D = 1 at z = 0) , I am implicitly taking P(k) to mean 
the linear matter power spectrum, normalized at z = 0. 
Both the growth function and the power spectrum depend 
on a number of fundamental cosmological parameters (h, 
as, w, 7, etc.), which is what we ultimately would 
like to constrain. 

In a nutshell, the FKP result implies that the contri- 
bution from galaxies inside the volume dV x to the vari- 
ance of the power spectrum estimated over some Fourier- 
space volume dVk = 4?vk 2 dk / '(2ty) 3 (i.e., the bandpower 
at dVk), is given approximately by <xf> = (AP/P) 2 — 
2 / (U 2 dVk dV x ) . A related quantity of interest is the effec- 
tive volume (Tegmark (19971] for the mode k, defined as 



Veff(fc) = / dV x U' z . These results were later generalized to 
include the case where different tracers of large-scale struc- 
ture (i.e., tracers with different biases) are used jointly to 



constrain the power spectrum Percival et al. (20031; White 



et al.| | j2008) >; |McDonald fc Seljak| ( |2008| )j. 

The FKP formula has an intuitive interpretation in 
terms of functions of phase space. The density of information 
contained in a phase space cell centered on (A:, x) is deter- 
mined by two densities: the effective biased density of galax- 
ies, N = nb 2 D 2 , which (under the assumption of linear and 
deterministic bias) is a function of position x, and the lin- 
ear spectrum P{k), which is the density of modes in Fourier 
space. The adimensional function \U 2 = § [NP/(1 + NP)] 2 
can then be interpreted as some kind of density of informa- 
tion in phase space. 

In a series of elegant papers, A. Hamilton, M. Tegmark 
and collaborators IHamiltonl (|1997a|bl); ITegmarkl (|1997|); 



Hamilton ( 1997c I; Tegmark et al. ( 19981] showed that the 



FKP formulas can be derived from the Fisher matrix of 
galaxy counts in cells (i.e., when the spatial cells are the 
pixels), under the assumption of gaussianity. In that case 
the Fisher matrix for the power spectrum can be written in 
terms of the pixel-space covariance C(8 X , 5 X /) = (S X S X ) and 
its derivatives with respect to the bandpowers Pi — P(ki) 



mensional reduction methods employing pseudo-Karhune- 
Loeve (pKL) eigenmodes (which become, in effect, the pix- 
els). These methods have been extensively employed in the 
analysis of the power spectrum in the SDSS, and are re- 



viewed by Tegmark et al. ( 2004a 20061 



It is well known, however, that the FKP effective volume 
suffers from some limitations. 

First, the FKP Fisher matrix associated with V e //(fc) is 
purely diagonal, FpKp(k,k') ~ Veff(k) <5e g,, which means 
that it neglects the covariance between the estimates of the 
bandpowers P(k) and P(k'). This not only overstates the 
constraining power of the Fisher matrix, but it also does 
not allow us to estimate the optimal size of the bins or, 
equivalently, to compute the principal components of the 
full matrix. Knowledge of the full Fisher matrix would be 
useful to improve the forecasts of constraints on cosmological 
parameters, to obtain the minimal size of the k bins, and 
even to inform the choice of pKL modes. 

Second, the effective volume does not take proper ac- 
count of long-range correlations. In order to better appreci- 
ate this deficiency, consider the pathological case of a galaxy 
catalog that is formed by two disjointed volumes, Vi and 
V2. For simplicity, assume that the average effective number 
density of galaxies is the same (No) in both volumes. The 
FKP formula then tells us that the variance of the power 
spectrum at the scale k that can be estimated with that 
catalog is: 



2 (k) 



1 dV k 
2 



dV k 



U 2 dV x 



N P(k) 



(1) 



N P(k) 



x (V1 + V2) 



We recognize this as the sum of the diagonal terms of the 
Fisher matrices for the galaxies in Vi and that for the galax- 
ies in V2. But there is no cross-correlation term, meaning 
that the information residing in the correlation between any 
galaxy in V\ and any other in V2 has been somehow neglected 
in that approximation Q 

These problems arise out of what Hamilton has called 
the "classical approximation" Hamilton (1997a c)], whereby 
only galaxies in the same shell in position space, and only 
power spectrum estimates in the same shell in Fourier space, 
are allowed to have non-vanishing correlations. The term 
"classical limit" is inherited from the language of quantum 
mechanics, and that toolbox turns out to be useful in the 
context of the statistics of galaxy surveys Q The language 



1 I would like to thank Ravi Sheth for bringing this puzzle to my 
attention. 

2 This paper is heavily indebted to ideas and notation set forth 
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and concepts of quantum mechanics are convenient in this 
context because some objects of interest can be diagonalized 
in one basis, but not the other: e.g., the linear power spec- 
trum is diagonal (at least in standard linear theory) only in 
the Fourier basis, while the shot noise term in the covariance 
of galaxy counts is diagonal only in the position-space basis 
- hence, in that sense, these two operators do not commute. 
The covariance of galaxy counts, however, is not diagonal 
in either one of these basis, which is the main complicating 
factor. 

The FKP result follows from making two distinct ap- 
proximations in the Fisher matrix of galaxy surveys: i) the 
first step (the "classical approximation") is to take all op- 
erators (such as the correlation function or shot noise) to 
be classical, and therefore commuting with each other; ii) 
the second step consists in taking the limit whereby the 
phase space window functions exp \i(k — k') ■ (x — af)j — > 

x — x"). This limit is basically a station- 
ary phase (SP) approximation. 

This paper shows how to obtain an exact, but formal, 
expression for the full Fisher matrix of galaxy surveys. It 
also shows hot compute the full Fisher matrix in the classi- 
cal approximation - but without having to make use of the 
SP approximation. The calculation of the Fisher matrix in 
the classical limit relies on the use of a phase-space basis (in 
both position space and Fourier space), which allows the in- 
version of the pixel-space covariance in that approximation. 
I compute not only the Fisher matrix for the estimation of 
the power spectrum on bins of the Fourier modes k, but also 
the Fisher matrix for the estimation of functions of redshift 
(such as the bias of each tracer and/or the growth function), 
as well as the cross terms of the Fisher matrix which express 
the correlation between estimations of the power spectrum 
as a function of k and estimations of bias (and/or growth 
function) as a function of z. Readers uninterested in the 
details of the calculation can skip to the end of Section [3j 
where the main results of this paper are summarized by Eqs. 
f40l-(|45l, as well as their classical limits, Eqs. (|46|)-(|48]). 



This paper is organized as follows. In Section [2] the 
Fisher matrix for an arbitrary survey of multiple types of 
tracers of large-scale structure is derived from the covari- 
ance of galaxy counts using standard notation. In Section 
[3] I examine the Fisher matrix from the perspective of ob- 
jects borrowed from quantum mechanics - operators, basis 
vectors, states, etc. Starting from the covariance matrix in 
phase space and its nai've inverse, I derive the full Fisher 
matrix for galaxy surveys, and show that it has all the right 
properties - including the fact that, of course, it reduces 
to the FKP formula after taking both the classical and the 
SP approximations. Still in Section [3j I compute the Fisher 
matrix for the bias and/or growth function, as well as the 
terms of the full Fisher matrix which mix the estimation of 



in A. Hamilton's papers [Hamilton &c Culhane| |l996| l; |Hamilton| 
( |1997a|b|c| |], who was (to my knowledge) the first to make exten- 
sive use of the analogy between stochasticity in galaxy surveys 
and the language of quantum mechanics, e.g., treating the two- 
point correlation function and the power spectrum as a single 
"operator" expressed in two different basis (position- and Fourier- 
space, respectively). 



the power spectrum with the estimation of position-space 
functions from the same galaxy survey. Finally, in Section [4] 
I consider, as an application, an isotropic survey with effec- 
tive number density N(x) and an isotropic power spectrum 
P(k). When N(x) is given by a top-hat profile, there exists 
an analytical solution for the Fisher matrix in the classi- 
cal limit (but without having to assume the SP approxima- 
tion). That analytical solution shows that cross-correlations 
between different bandpowers can arise if the Fourier-space 
bins are too small - a problem that may affect some recent 
analyses of large-scale structure Percival et al. ( 20101]. In 



that Section I also derive an analytical expression for the 
Fisher matrix which measures the information contained in 
the cross-correlation between two species of tracers with top- 
hat density profiles. In particular, I show that in this exam- 
ple the Fisher matrix can be expressed in terms of phase 
space window functions which are basically identical to the 
Fisher matrix that follows from expressions found in Hamil- 
( 1997a|b' k I present, in an Appendix, a semi-analytical 



ton 



formula for the Fisher matrix in the case of a galaxy sur- 
vey with an arbitrary distribution of any number of different 
tracers of large-scale structure. 

For this purposes of this paper I will only work in posi- 
tion (real) space, but the generalization to redshift space is 
straightforward: the power spectrum, in particular, inherits 
the redshift distortions and the associated dependence on 
the direction of the modes, P(k) — > P"(k±,k^). I also do 
not fully explore the Fisher matrix for position-dependent 
degrees of freedom such as the bias and growth function - 
that will be the subject of a forthcoming paper. 



2 THE COVARIANCE MATRIX OF GALAXY 
COUNTS AND THE CLASSICAL 
APPROXIMATION 

The basic object in the construction of the Fisher matrix is 
the covariance matrix for galaxy counts. I will consider many 
different species of tracers (e.g., red galaxies |Tegmark et al 



(2004b 



et al. 



1997 1], blue galaxies [Norberg et aL]p0 02); Tcg mark 
2004c)], emission-line galaxies [Blake et al.| ( |2011|)], 



neutral H regions probed by quasar absorption lin es [Seljak 



et al.| ( 2005a |], quasars |Sawangwit et al. (20111; Abramo 
et al.| ( 2011 1], etc.), with mean number densities and linear 



biases given by n M (a?) and bp(x), respectively, where greek 
indices /i = 1 . . . N t denote the different types of tracers. 
The two-point correlation function between the counts of 
any two types of tracers is given by: 



= b„{<g)D{x)b„($)D{$)t{x 
B^(x)B v (y) 1 



y) 



(2) 



(2tt) 3 

where we have included the matter growth function D(x) — 
D(z[x]) into the definition of an effective bias B^ = b^D 
(which is assumed linear and deterministic), and P(k) is 
the linear matter power spectrum. Because the power spec- 
trum is the Fourier transform of the real function £, it obeys 
P*(k) — P(—k). If isotropy holds, the spectrum is a (real) 
function of k = \k\, but I will not assume this until we 
come to Section IV. Notice that (,fj, v (x,y) = £,viJ.{y,x), but 
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Finding and mapping individual objects is basically a 
point process, subject therefore to stochasticity (shot noise, 
in this context). Cosmologists usually make the simplest pos- 
sible assumption and take a Poisson distribution for the shot 
noise of counts in cells - although this may be an overesti- 
mate, particularly if one counts halos instead of galaxies ICail 



et al. (2011 1). In that case, the covariance of galaxy counts 
can be expressed as: 



d 3 k 



ik-(x — x ) 



5 D (x — x) 



(3) 



(2tt) 3 

B„(x)B„(x')P(k) + 



\Ai M (f)n„(z') 



where the Kronecker delta expresses the absence of shot 
noise when cross-correlating different types of objects in the 
same cell. 

If a set of observables Qi obey a Gaussian distribution 
with zero mean, then we can immediately write their Fisher 
information matrix with respect to a galaxy survey |Vogeley| 
fc Szalay|fl996[);|Tegmark et al.| (|1997[)]: 



F 



-Tr 



aiogC dlogC 



= rtr 



f.tua/3 



dQj 
~dQ~ 



(4) 



a 



a/3 



dQj 



where I denote the trace over position- and Fourier-space 
variables (i.e., the integrals over those variables) with the 
lower case. For the sake of clarity, in this Section I will leave 
the integrals over real space and Fourier space explicit. 

Suppose that we wish to estimate the bandpowers of the 
power spectrum, i.e., the amplitudes P(ki) on bins ki [for 
reasons of dimensionality, it is more convenient to estimate 
logP(fci)]. The derivatives of the covariance matrix with re- 
spect to the bandpowers are the functional derivatives: 

B^)B v {y) 



SC^(x,y) 



Slog P(ki) 



d s k 



ik-(g-y) 



(2tt) 3 



5P(k) 



SP(ki) 
Bplft Buffi e 



(5) 



ifcj - (x — y) 



P(h 



where I have used the fact that, with the conventions used 
in this paper, Sf(k)/Sf(k') = (2irf5 D (k - k'). The Fisher 
matrix for the power spectrum can then be written as: 



F P (ki,kj 



-ik^-(x —y)—ikj-(y —x) 



~ / d 3 x d 3 x d 3 y d 3 y e 



xC^(y,y')Bp(y')P(k 3 )B»(x) . (6) 

The main problem with Eqs. Q or |6| is that, using 
position-space pixels, it is not feasible to invert the covari- 
ance matrix C M „ due to its enormous size. The formal ex- 
pression for the covariance does not take us far either, since 
we would then need to solve a system of integral equations: 

d 3 x' ^2 C~l{x, x) C ua (x, x") = S^c S D (x - so") . (7) 



Depending on the concrete case, different approxima- 
tion schemes can be used to invert the covariance matrix. 
One such technique consists in consolidating the the spatial 
pixels into a much smaller set of pKL eigenfunctions, which 
drastically reduces the dimensionality of the covariance ma- 
trix |Tegmark et~aT1 ( [1998] |2004a| |2TJ0"6| |]. In this case, the 
particular choice of pKL decomposition is justified a poste- 
riori, in the sense that the particular choice of compression 
is shown to be nearly lossless. 

A different scheme that has been used to invert the 
covariance matrix is to try an approximate solution to the 
formal expression, Eq. (|3| - see, e.g., Hamilton ( 1997a). The 
approximation follows from the fact that the average volu- 
metric density n M and the bias _B M = b^D vary slowly as a 
function of position, compared with the exponentials e lk ' x 
in Eq. (J3j) . This means that we can take the integrand of Eq. 
(J3j and use it to generate an approximate inverse covariance: 

d 3 k 



— ik-(x — x ) 

(2tt) 3 6 
B„(x)B v {x)P{k) + 



(8) 



If we now take the inverse of the expression inside the brack- 
ets above to mean the naive matrix inverse of the Nt x Nt 
square matrix, we obtain: 



C, 



(2tt)3 



5uv 



1 + N(x,x')P(k) 



and N(x,x") = V 4> a {x)(j) a (x). Here 



where • 

(fjfj, plays the role of a shot noise-corrected effective bias for 
the species /i, in the sense that the clustering of two species 
H and v as a function of position (and, therefore, as a func- 
tion of redshift), normalized by the underlying matter power 
spectrum, has a signal-to-noise ratio proportional to <j>p<l>v 
- and this definition should not be confused with the effec- 
tive bias as defined in, e.g., Tegmark et al. (2004b I. Substi- 



tuting the ansatz of Eq. |9l into Eq. ([7j) it can be verified 
that the corrections are small when tj)^ are smooth func- 
tions of the spatial coordinates - in fact, the approximate 
solution of Eq. Q should be regarded as the first term of 
a perturbative series, where the higher-order terms can be 
obtained by iteration starting with the lowest-order solution 



[Hamilton ( 1997a l). We will show in the next section that 
this expression in fact follows from the use of the classical 
approximation. 

Substituting our ansatz, Eq. j9j, back into Eq. (TsJ) , we 
obtain: 

1 r j3 „* d 3 k d 3 k' 



Fp.a « -P(h)P(k 3 ) J d 3 xd 3 x'd 3 yd 3 y' — — (10) 

x 3?) — ikj-(y'—x) — ik- (x-x 1 ) -ik' -(y-y 1 ) 

N(x,x) + P{k) [N 2 (x,x') - N{x,x)N(x',x)] 

l + N(x,x')P(k) 
N(y, gO + P(k') [N 2 (y, y 1 ) - N(y, y)N(y', f)] 

l + N(y,y>)P(k>) 

Integration over k and k! will select only the positions such 
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that x! ~ x and y ' « y, respectively (this is the first instance 
where we need the SP approximation). Hence, if in Eq. ( 10 I 
we make the substitutions N(x,x") — > N(x,x), N(x",x") — > 
N(x,x), etc., the terms inside square brackets cancel, so 
after integrating over a? and y' we obtain: 



Fj 



P,ij 



j3 ?3 i(kj —ki) -(x — y) 

a x a y e y 3> y 1 



(ii) 



x N(x)P{kj) N(y)P(k 3 ) 
l + N(x)P(ki) l + N(y)P(kj) ' 

Here N(x) = N(x, x) = n M B% plays the role of a total 
biased effective number density of tracers. The integrand of 
Eq. (Ill is basically the FKP pair window 
bandpowers P(ki) and P(kj 
wavenumber, ki = 



if we take the 
to be evaluate d at the same 
( 1997a] . This 



Hamilton 



tj - see, e.g 

equation also shows that (within our approximations) the 
best possible estimator for the power spectrum has a co- 
variance which is given by the inverse of the Fisher matrix 
above. In fact, the FKP method corresponds to weighting 
pairs by the inverse of the variance of the power spectrum - 
i.e., the weights are the diagonal elements of the Fisher ma- 
trix. An even better estimator is provided by the quadratic 



method Tcgmark et al. (1998 2004b |], which employs the 



full Fisher matrix - and therefore takes into account the 
correlation between estimates of the power spectrum at dif- 
ferent scales. 



The advantage of Eq. (Ill is that it splits the problem 



of computing the Fisher matrix into two separate Fourier 
integrals - in contrast to the usual approach |Hamilto"n 
1 1997a|b l]. Nevertheless, I will show below that, at least for 
the simple case of a survey with a top-hat effective number 
density, N(x) = Nq8(xo — x), Eq. ( 11 1 reduces to an expres- 
sion very similar to that which can be obtained directly from 
the classical limit of the Fisher matrix |Hamilton ( 1997a|b' l], 
In Section [3] I will derive Eq. (Ill using the language and 
tools of quantum mechanics, and in Sectional will show how 
to compute the Fisher matrix in terms of semi-analytical ex- 
pressions, in the case of an isotropic distribution of galaxies 
and an isotropic power spectrum. 

Either by direct computation or by induction from Eq. 
(Ill, one can easily write the contributions to the Fisher 



matrix for the bandpowers of the power spectrum that come 
from each one of the individual tracers, as well as from their 
cross-correlations : 



N^x)P(k t ) N v (y)P{ki 



(12) 



1 + N(x) P(ki) l + N(y)P{k 3 ) 

where JV M = 4>fi is the effective biased number density of the 
tracer species /i. This expression generalizes the results of 



Percival et al.| ( |2003| ); [White et al.| ( |2008| ); |McDonald fc Seh 
jak| ( |2008| ), which were found only in the classical limit and in 
the SP approximation. It may be useful to regard the prod- 
uct JV M P — > P M as the power spectrum of each individual 
species of tracer, and N P = ^2 P M as the total spectrum 



indeed, that was the notation used in White et al. (2008) 



The Fisher matrices above are additive, as they should be, 
with the total Fisher matrix, Eq. (Ill, being given by the 



sum over all species, F P jj = ^ F F\ij- 



Eqs. ( 11 1-| 12 1 have an intuitive interpretation in terms 



of the interference between the information in the phase 
space cell (x, ki) and the information in the cell (y, kj). The 
phase difference e~ l< - ki ~ kj ^' l - x ~^ can be regarded as a phase 
space window function, since it creates a constructive or de- 
structive interference between the cells which oscillates very 
rapidly if either ki 7^ kj or if x 7^ y. If a pair of galaxies 
occupies the same or a nearby spatial cell, x ~ y, then there 
is no phase difference and the contribution to the Fisher ma- 
trix is basically flat, with the only sensitivity to the power 
spectrum P(k) coming from its scale dependence - and in 
fact, that is the information which is encoded in the effective 
volume V e ff (k). The phase space window function also takes 
into account the fact that galaxies separated by a wavelength 
A contribute most to those wavenumbers which are sepa- 
rated by the harmonics of that wavelength, Ak n • Ax = 2nn 
(n = 0, 1,2, . . .) - but that contribution falls off with n, on 
account of the faster oscillations of the window function. 

The FKP result can now be obtained by taking the 
functions in phase space to be simultaneously localized in 
position and in Fourier space. A straightforward way of im- 
plementing this approximation is to notice that e tAk ' Ax [ s 
in fact a window function in phase space which is already 
normalized to unity over the volume of phase space, so a 
stationary phase (SP) approximation naturally leads to the 
substitution: 



(2n) 3 S D (Ak)S D (Ax) 



(13) 



In the SP limit the Fisher matrix for the power spectrum 
reduces to: 



N(x) P(kj) 
1 + N(x) P(h) 



(14) 

which is the familiar result. For the individual species of 
tracers, the expression is: 



p, ij 



^(27v) 3 5 D (k t 



d3x N„(x)N„(x)P< 



1 + 



N(S)P(ki)j' 



(15) 



The integran d in Eq. (|15[) is essentially the result of |Percival"| 
et al.N2003l;|White et all (120081). It is possible to derive this 



result also by minimizing the variance of the multi-tracer es- 
timator of the power spectrum, as in |Percival et al.] |2003), 
or by directly computing the covariance of the power spec- 
tra between the tracers, as in White et al. (20081. Here I 
obtained this result from the Fisher matrix in pixel space, 
which is a direct check that the generalization of the FKP 
pair weighting to many types of tracers in fact corresponds 
to the least-variance estimator. 

There are, however, important differences between the 



more general result, Eq. ( 12 1, and its SP limit, Eq. ( 15 1: first, 



the off-diagonal elements of the Fisher matrix are present in 
the general expression, but not in the SP limit; and second, 
the manner in which large-scale correlations are manifested 
in the Fisher matrix. To appreciate this difference, consider 
again the pathological case already discussed in the Intro- 
duction, which I will restate here in more generality. Sup- 
pose that the tracer species 1 has nonzero density at some 
position x\ but it vanishes at the position X2; likewise, the 
tracer species 2 has nonzero density at X2 but vanishes at x\ . 
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In such a scenario, the integrand of Eq. ( 12 I for the cross- 



correlation term F P ij is nonzero when evaluated at those 



two points; however, in the SP approximation, Eq. (15 1, the 
integrand vanishes both at x = x\ and at x = x%. Hence, 
in order to fully retain the correlations at different spatial 
points, we need to keep the distinction between the different 
shells in phase space, which is exactly what the expressions 



of Eqs. |lll-|12l do 



3 FISHER MATRIX IN THE LANGUAGE OF 
QUANTUM MECHANICS 

I will now employ some tools borrowed from quantum me- 
chanics in order to derive a few useful results, in partic- 
ular the Fisher matrix for the power spectrum P(k), the 
Fisher matrix for the effective shot noise-corrected bias, 
4>y.(x*) = yfri^B^, and the terms of the full Fisher matrix 
which correlate the estimations of P(k) and <j>p(x) when we 
try to estimate both from the same dataset. This is by no 
means the only way to obtain the full Fisher matrix, but is 
perhaps the briefest. 

As first pointed out by Hamilton Hamilton ( 1997a|c l], 
the stochastic variables involved in calculations of the Fisher 
matrix can be expressed as operators, while position space 
and Fourier space are simply two different bases in Hilbert 
space. The normalization of the basis vectors (the Dirac 
bra's and ket's), as well as the relationship between the 
position-space basis and the Fourier basis, are the usual 
ones: 



(x\x) 
(k\k') 
(x\k) 



Sd(x — x ) , 



(16) 
(17) 
(18) 



In that way, the two-point correlation function and the 
power spectrum correspond to the same operator expressed 
in two different basis, = £(x — x') and {k\£\k') — 

(2n) 3 P(k)dD(k — k'). By selecting a discrete basis (bins in 
Fourier space or bins in position space), operators take the 
form of matrices. 

For simplicity, in this paper I will assume that the av- 
erage densities n M are directly measured (not estimated), 
which means that they commute with the derivatives with 
respect to any parameters of interest, dn^/dQi — 0. In that 
case, the trace over all indices that defines the Fisher ma- 
trix, Fij = (l/2)Tr[C*" 1 C, 4 C" 1 C, j ], cancels all the factors 
of the average densities that appear outside of the effec- 
tive biases M = y'n^B^. This means that we can pull the 
number densities out of the covariance matrix, and write 
the Fisher matrix in terms of a renormalized covariance 

where: 



F, = (l/2)Tr[C- 1 C, I C- 1 C 



C^(x, y) = <5 M „ + (j}^{x) £(x - y) <f>„(y) 



(19) 



Let us then define a renormalized covariance operator as: 



C u 



+ 



t>t v ] pi 



(20) 



where the effective bias operators <^ M = cfy are diagonalized 
in the position-space basis, M |af) = cf>^(x)\x), and the spec- 
trum operator p = p* is diagonalized in the Fourier-space ba- 
sis, p\k) =p(k)\k), withp 2 = | so that f\k) = P{k)\k). The 
covariance is clearly hermitian, C\ v = CV M . Furthermore, 



taking the expectation value of the covariance operator in 
the position-space basis, {x\Cpi,\x"), leads to Eq. (|3p - up to 
the factors of the average densities, which are traced out of 



the Fisher matrix. The ordering of the operators in Eq. ( 20 1 



is in fact unique: if we had defined the renormalized covari- 



ance in any other way, e.g. CZ V 



y p, then its 



expectation value on a position basis {x\C^\x") would not 
reduce to the correct expression, Eq. pj. This is a direct 
consequence of the fact that some operators are diagonal in 
one basis, but not on the other, which means in particular 
that the operators </> M and p do not commute - otherwise all 
possible orderings of the operators in the covariance would 
be equivalent! 

In order to invert the (renormalized) covariance matrix 
operator, it is useful to define the total effective covariance 
as follows: 



c = i + £i 



At 



p f = 1 +pNp t . 



(21) 



where N = ^2 <i>ti.<\>\- In terms of the operator C, the exact 
inverse of the covariance operator is given by the formal 
expression: 



1 p4> v 



(22) 



Of course, this still leaves open the problem of inverting C, 



but Eq. ( 22 1 shows that the total effective density N which 



appears in the definition of C appears quite generically as a 
result of inverting the covariance of galaxy counts. The in- 
verse of C can be obtained either directly from the dataset, 
or formally in terms of a perturbative series around the op- 
erator pNp] - in much the same way as was proposed by 



Hamilton (1997a I. This result also shows that, in order to 
obtain the exact inverse for the covariance C^ v , all that we 
need is the exact inverse of the total effective covariance, C, 
and not the solution to a higher-dimensional linear system 
involving all pairs of every possible type of galaxy. In fact, 
the higher-dimensional linear problem has more equations 
than unknowns, and it would be singular were it not for the 
fact that it can be reduced to a single matrix inversion (that 
of C). 

It is instructive to compute the total effective covariance 
in a mixed basis: 



(k\C\x) = 



= P(k) 



+ e ' 



(k\(l +pNp t )\x) 
d s k' 



(23) 



(2tt)3 



I d 3 y{k\y){y\Np^\k'}{k'\x} 



I ^3^' S P(k)N(k-k')p*(k') 



+ e ' 



where N is the Fourier transform of the total effective den- 
sity. By invoking the classical limit, or equivalently, assum- 
ing that N(x) is a smooth function of position, this expres- 
sion can be approximated by: 



(k\C\x) w e* 2 [l + N{x)P(k)} 



(24) 
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3.1 Exact Fisher matrix 



3.2 Approximate expressions 



In order to derive the Fisher matrix we need to compute the 
derivatives dC^^/dQi, where Qi are the parameters that we 
wish to estimate. In the case where these parameters are the 
bandpowers Pi — P(ki), these derivatives can be expressed 
as the operator: 



dP, 



d 3 k d ~ t „ t 



(2tt) 3 dPi 



4>lfr \k)(k\p4> v = 4>l \ki)(ki\ cj> v 



(25) 

Using the definition of the total covariance we have that 
P ( t > v4>t'P* r = pNp^ = C — 1, and therefore: 



d log C MQ _ ~ i dC va 
dlogP, ~ M " 9 log Pi 



(26) 



= (plpC 1 \h) {ki\p 4> a 



from which we immediately obtain the Fisher matrix for the 
(log of the) bandpowers: 



(27) 



-{h\{l - C- X )\kj){ki\{l - C^ 1 )!^) 



Similarly, if we want to estimate the effective biases 
<j} a (x) from the data, the relevant partial derivatives for that 
Fisher matrix are: 



d(j) a {x) 



d 



d 3 x d 3 x" <j> ti (x)\x){x\p' 

Xp\x"){x"\ 4>v{x') 

= <W \x)(x\jy p4>l + S„ a $lp'p\x){x\ . (28) 

A calculation similar to the one performed above for the 
case of the bandpowers leads to the Fisher matrix for the 
effective bias: 



F«y(x,y) 



1 

-tr 
2 



dC Pli 

d\ogN a {£) a0 <91ogiV 7 (y) 



■S^NAx) (x\y){v\pH^ 



)p\x) 



+ - N^N^tf^C^pm^C-^x) 
-±N a (x)N^y) [(Slp+plSXi/lp^-'pla!) 
+ (S^C^pM^pW] . (29) 

The Fisher matrix for the total effective number density, 
N, can be obtained by tracing the effective biases, Fm — 

Finally, we can also compute the cross-terms of the 
Fisher matrix that mix the power spectrum estimation with 
the estimation of the effective bias: 



F Pa (k,x) 



-tr 



A-l 



dC va 



dC, 



"' A 91ogA CT (f) 



aiogP(fc) 

[{ktf^WpC-^k) 

+ (fc|C _1 J> t |x><£B|p|fe> 

- 2(fc|C~V|:r)<:E|pC~ 1 |fc)] 



(30) 



Eqs. (|27j)-(|3Gj are exact, but unless we figure out how to 
invert the total covariance C, they are not of much use. In 
order to obtain expressions that we can work with, some 
approximate expression for that inverse must be produced. 
The crucial step at this point is to use the classical approxi- 
mation, so that p commutes with <^ M , and therefore operators 
such as the inverse total covariance can be expressed as a 
power series: 



cr 1 = (i + pNfy L « l-pfN + ipf) n 2 + 



(31) 



Just as we used a mixed basis to obtain Eq. ( 24 1, the matrix 



elements of the inverse total covariance and other similar 
operators in a mixed basis can be written, in the classical 
approximation, as: 



(fc|C _1 |x) 



(fe|C _1 p|^> 



:\p C 1 p\x) 



ik-x 



1 + N(x)P(k) 
1 + N(x)P{k) 

o ik-S P(k) 

1 + N{x)P{k) 



(32) 
(33) 
(34) 



What this means is that 
{k\&- x 6\k') 



(2Trf5 D (k-k') 

d 3 x(k\C' 1 \x)(x\C\k') 

d 3 x e lt3 [l + N{x)P{k)y 1 
x e- A '- s [l + N(x)P(k')\ , 



(35) 



and a similar expression in the position-space basis. 

Using the classical approximation in the Fisher matrix 



for the power spectrum, Eq. (271, one readily obtains the 



same expression that was derived in the previous Section, 



Eq. (111. We can also obtain the Fisher matrix for the ef- 



fective biases in the classical limit, Eq. (29 1, in a similar 
fashion: 



Pcr 7 (•£, X ) 



5 CT7 8d(x — x) Na(x) (36) 

d 3 k N(x)P 2 (k) 
2tt) 3 l + N(x)P(k) 

1 at ^ir /-</\ f d 3 kd 3 k' -i(%-%').(g-j?) 
-N tT (x)N J (x)J w e 

P{k) P(fc') [2 - N(x)P(k) - N(x')P(k')] 



[1 + N{x)P(k)][l + N(x')P(k')] 



The Fisher matrix for the total effective density N can be 
written as: 



F N (x,x) 



d 3 k d 3 k' 



-i(k — k ) - (x — x ) 



8 J (2tt) 3 (2tt) 3 

^N(x)P(k) + N(x')P{k')Y 



(37) 



[1 + N(x)P(k)][l + N(x')P(k')] 
Under the classical approximation, the cross-terms of the 



Fisher matrix which mix power spectrum and bias, Eq. ( 30 1 
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become: 



3.3 Generalized FKP formulas 



F Pa (fc, X) 



lN a (x)N(x)P 2 (k) 



2 [1 + N{x) P(k)} 2 
In terms of the total effective density of tracers, we have 



(38) 



F PN (k,x) 



N(x) P(k) 



(39) 



l + N(x)P(k) 
The main results of this Section can be summarized as 



follows. In tandem with the notation of Hamilton (1997b), 



let's define the phase space functions weighting functions 
(which are nothing but the FKP weights): 



(7 M (fc,f) 
U(k, x) 



N„{x)P(k) 
1 + N(x)P(k) 

N(x)P(k) 
1 + N(x)P(k) 



(40) 



Recall that these weight functions are r elate d to the total 
covariance operator C, defined in Eqs. (21 1 and (231, by 



{x\{l -C _1 |fc) w e- ,h ' x U(k,£). With the help of this func- 
tion we can write the Fisher matrix for the power spectrum 
as: 

F P {k,k') « ^ J d 3 xd 3 x' e l[i -^ ){s - s ' ) U{k,x)U{k',x l ) 



L -(k\U\k')(k'\U\k) 



(41) 



Likewise, the Fisher matrix for the total effective number 
density N = n^B 2 , is: 



Fjsr(x, x 



1 I" d 3 k d 3 k' -»(jE-£')(£-g') 
8 J (2tt) 3 (2tt) 3 6 

x ^2U{k,x)U(k',x) 



(42) 



+ 



U 2 (k,x) 



1 - U(k,x. 
+ [l-U(k,x)\ - 



[l - U$',tf) 



U 2 (k',x') 
- U(k',x') 



The advantage of these formulas is that they make clear that 
all we need to compute are expressions such as: 



(k\U\k') = 



d 3 xe K% - % ' ys U{k,x) 



d fe — zfe'(£— x ; ) ^ (fc, x) 

(2tt) 3 6 1 - [/(fc, z) 



(43) 



(44) 



Last but not least, the cross-terms which mix power 
spectrum and bias are given by: 



F Pa (k,x) 
F PN (k,x) 



-U(k,x)U a (k,x) 

^-U 2 (k, x) = F Pa (k, x) 



(45) 



Equations 1 38 1-( 45 1 therefore express the full Fisher 



matrix in the classical approximation, and are the main re- 
sult of this paper. 



Finally, let's recover the FKP results by taking the station- 
ary phase (SP) limit of Eqs. pT| and (|42|. Recall that, 
in order to take that limit all we need to do is substitute 
e iAk-A3 _^ (2n) 3 S D (Ak)5 D (Ax). In that case we obtain: 



F P (k,k') 



(2tt) 3 S D (k - k') J d 3 xU 2 (k,x) 
(2irf5 D (k~k')V ef f(k) , 



(46) 



which is the usual result. 



As for the mixed terms of the Fisher matrix, Eqs. ( 38 \ 



(39 1 or, equivalently, Eq. (45 1, the classical limit result is 



already in the SP limit, in the sense used here. 

In the case of the Fisher matrix for the effective bias, 
the result is: 



y (x, x) — > -Sd{x — x) J 



d A k 



Ua 



(2tt) 3 

x [S ai U(l + NP) + U 1 (l 



(47) 



NP)] 



For the total effective number density, taking either the SP 



limit of Eq. (37 1, or summing over the species of tracers in 



the expression above, leads to 
1 



F N (x,x) 



Sd(x — x) 



d 3 k 

(27T) 3 

5n(x — x) V e ff(x) , 



U 2 (k,x) 



(48) 



where the last term on the right-hand side can be interpreted 
as the effective volume in Fourier space. Just as the average 
number density of galaxies, the fiducial bias and the growth 
function affect the accuracy of the estimations of the band- 
powers through V e ff (k), the fiducial power spectrum also 
affects the accuracy with which we can measure the bias 
and the growth function through V e ff (x). 

An important feature that emerges from the analysis 
above is that the estimations of the bandpowers and of the 



biases are correlated, as shown by Eq. ( 45 1. The bottom line 



is that one should not naively assume some normalization 
for the power spectrum in order to fit a model for the bias, 
and then use that bias model in order to fit the power spec- 
trum, while expecting that the errors in the power spectrum 



should still be given simply by the Fisher matrix of Eq. ( 46 1 



Because the estimates of the power spectrum are all corre- 
lated with the estimates of the bias, one should in fact esti- 
mate both jointly. The full Fisher matrix derived here allows 
this joint estimation from first principles, which is useful for 
making more accurate forecasts. The expressions above can 
also be used for a proper treatment of priors, such as an 
independent measurement of ag from either the cosmic mi- 
crowave background and/or cluster counts, or constraints on 
bias from gravitational lensing |Seljak et al. ( 2005b |]. This 
will be the subject of a forthcoming publication (Abramo 
2011, to appear). 

The results of this Section show a consistent pattern 
that can be summarized as follows. In the same way that we 
can regard the power spectrum as the density of modes [and 
in fact P(k) has dimensions of a density in Fourier space], the 
discussion above implies that we can interpret N(x) as the 
effective density of tracers in position space. The combina- 
tion \U 2 = \N 2 P 2 /(l + NP) 2 , in turn, can be regarded as 
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the density of information that can be obtained from a cata- 
log of galaxies whose biases we don't know, and whose distri- 
bution traces some underlying matter density whose power 
spectrum we also don't know. This density of information is 
naturally a phase space object, since it depends on knowl- 
edge about objects which live in Fourier space and in posi- 
tion space. Hence, the total information contained in the vol- 
ume cells AV X and AVfc is ^U 2 AV x AVk - and this is, in fact, 
the Fisher information matrix per unit of phase space vol- 
ume. The usual FKP Fisher matrix for the power spectrum, 
evaluated at the bin AVfe, is obtained simply by tracing out 
the position-space volume, Fp(k,k') — S kk iAVk f dV x \U 2 . 
The Fisher matrix for the bias, evaluated at the spatial 
bin AV X , is found by tracing out the Fourier space vol- 
ume, Fn(x,x') — 5 XiX iAV x f dVk\U 2 . And the elements 
of the Fisher information matrix that express the corre- 
lations between estimates of the power spectrum and the 
estimates of bias, evaluated at the bins AV X and AVfc, is 
F PN = \U 2 AV x AV k . 

As a curiosity, notice that the properties of \U 2 are 
very similar to those of another object of deep significance 
in quantum mechanics: the Wigner distribution function, 
which is the phase space equivalent of the density matrix 
|Wigner| ( [19321 ) ; |Peres| ( |2002| )]. Both the density matrix and 
the Wigner function can be interpreted as probability dis- 
tribution functions - with the caveat that the Wigner func- 
tion is not necessarily positive, so it is considered a "quasi- 
probability" [Peres ( 2002 )] . The Wigner function has a fun- 



damental role in the physical interpretation of quantum me- 
chanical states, since it describes how states are spread out 
in Fourier space and in position space. The Fisher infor- 
mation density \U 2 , similarly, describes how information is 
spread over phase space, and what is the probability of mea- 
suring some parameters in phase space [e.g., the bandpowers 
of P(k)\ within some interval. 



4 ANALYTICAL FISHER MATRIX FOR A 
TOP-HAT VOLUME-LIMITED SURVEY 

In this Section I will show that, when all variables are 
isotropic, P(k) — ¥ P(k), n(x) — > n(x) etc., then we can ex- 
press the Fisher matrix for the power spectrum in the classi- 
cal limit in terms of analytical formulas. The calculations in 
the case of the Fisher matrix for the effective number density 
are completely analogous, and the corresponding expressions 
can be found by exchanging the roles of position-space and 
Fourier-space in the formulas below. 

Let's start with the expression for the Fisher matrix for 



the bandpowers in the classical limit, Eq. (41 1, and assume 



that P = P(k), and N = N(x). In that case, we can average 
out the angular dependence of the Fisher matrix: 

d 2 k 

47T 



F P (k,k) = 



d 2 k' 



4tt 



F P {k,k') 



(49) 



= -Jdxx 2 J dx' x' 2 U(k, x) U(k' , x') 



d 2 K jo(Ak x) jo(Ak x) 



where hats denote unit vectors, k = k/k, Ak = \k — k'\, and 
jo(z) = sin(z)/z is the 0-th order spherical Bessel function. 
The angular integrals over k and k' can be performed by 



writing Ak = \/fc 2 + k' 2 — 2kk'^, where fi = k ■ k' , and 
then integrating over fi. At this point, we can use the exact 
integral: 



dfj,jo(Ak x) jo(Ak x) 



(50) 



2 k k' xx 
-Ci(Afc_Aa; 



7 [Ci(Afc+Aa;_) + Ci(Afc_Ax+) 
Ci(Afc+A:r + )] , 



where Ak+ = k + k' , Ak- = \k — k'\, etc., and Ci(z) = 
— /°° dxcos(x)/x is the cosine integral function. An alter- 
native expression for this integral can be obtained by em- 
ploying Rayleigh's expansion in Eq. ( |41[ ), expanding each 
one of the four phases in exp[ifc • (x — x') + ik' ■ (x" — x)] into 
spherical waves, and then integrating over all the angles in 
position space and in Fourier space. The final result can be 
recast in terms of the isotropic phase space window function 
W in two ways: 

1 1 

2-7T k k' x x' 
x [Ci(Afc+Ax_) + Ci(Afc_Ax + ) 

-Ci(Afe_Aa;_) - Ci(Afc + Aa; + )] 
= -£(2£+l) (51) 

t 

x je(kx) je(kx') je(k'x) je(k'x) . 
With the help of the asymptotic expansion of the cosine 



W(k,x;k',x') = 



integral function Ci(z) = 7 + 



+ 0(z 4 ), where 7 



is the Euler gamma constant, one can verify from the first 
expression that the window function is regular everywhere, 
including the limits Afc_ — > and Ai_ — > 0. From the 



expression in the second line of Eq. (51 1 one can verify that 



the window function is normalized upon integration over 
the two-dimensional phase space (k,x), by making use of 
the identities: 



r°° 2 

/ dzz j e (az)j e (bz) 
Jo 

J2(2e + i)j 2 (z) 



— a 6 D (a — b) 



(52) 
(53) 

(54) 



which then lead immediately to: 

J dkk 2 J dx x 2 W(fc, x; k' , x') — 1. 

So, it is clear from this expression that the classical limit, in 
the isotropic case, can be reached by taking W(fc, x; k' , x') — > 
k- 2 S D (k-k')x- 2 S D (x~x'). 

In terms of the phase space window function, the Fisher 
matrices can be written, for the power spectrum, as: 

F P (k,k') = ^{2n) 3 J dxx 2 j dx x' 2 (55) 

x W{k, x; k', x) U(k, x) U(k', x') , 
and for the effective number density as: 

F N (x,x') = ^(2ny 3 J dkk 2 J dk' k' 2 W(k,x;k',x') 

x {2U(k, x) U{k', x) + T(k, x) [1 - U(k', x')] 
+ [l-U(k,x)]T(k',x')} . (56) 

If we use the formula for the phase space window func- 
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tion in terms of the cosine integrals, then the radial inte- 
grations over x and x' , or over fc and fc' , are not separable 



anymore, as was the case in Eqs. |41l-|42l. However, by 



writing the phase space window function in terms of the in- 
finite sum over spherical Bessel functions we can perform 
the integrations separately. For the power spectrum Fisher 
matrix we have: 



F P (k,k') = i(2^) 3 ^(2£+l) 



(57) 



x J dx x 2 je(kx) je(k' x) U(k, x) 
x j dx 1 x' 2 ji(kx') je(k'x') U(k',x') 



The two integrals are exactly the same, so for a generic sur- 
vey all that is needed is to compute the Hankel transforms 
of the phase space weighting function U: 

U e (k;k') = J dxx 2 j l {kx)j l {k'x)U{k,x) . (58) 

The Fisher matrix is then given by the sum: 

F P (k,k') = ^(27t) 3 ^(2^+l)C/"(fc;fc')C/ £ (fc';fc) . (59) 

I 

In the following Subsection I will show that, for the sim- 
ple case of a top-hat density profile, we can employ the dual 
expressions of the phase space window function contained 



in Eq. (51 1 in order to obtain an analytical solution for the 



Fisher matrix. 



4.1 Analytical solution: top-hat profile 

Now I will show how we can obtain an analytical formula for 
the Fisher matrix Fp in the trivial case of a uniform effective 
density with a top-hat profile, i.e., N(x) — Nq9(xq — x), 
where 9(x) is the Heaviside step-function. With a top- hat 
density profile Eq. (57 1 becomes: 



F%{k,k') = 8n 2 U (k)U (k') ^(2£+l) 



dx x 2 je (kx) jt(k' x) 



(60) 



x / dx x' 2 je(kx') je(k' x) , 



where U (k) = N P(k)/[l + N P(k)]. The definite integrals 
above are straightforward, using the identity: 



dz z ji(az)ji(bz) 



1 



(61) 



[bjt-i(bz)jt(az) - ajt-i(az)j t (bz)] 
,,i _ h 2 [ b fe(bz)je(az) - aj' e {az) j e (bz)] , 



j 2 -6 2 
1 



where, from the second to the third line, I used 
the recursion relations for the Bessel functions, 
j' e (z) = je-i(z) — (£ — l)je(z) / z. Using this formula for 



the integrals over x and x' we obtain that: 



F P (k,k') = 8tt 2 U {k)U (k') 



(fc 2 - k i2 y 



x [kx j'e(kx ) ji(k'x Q ) - k' x j' e (k' x ) ji>(kx )] 2 
= 8-K 2 U (k)U (k')x 6 W°(k,k') . (62) 

The Fisher matrix above applies only to the self-correlations 
of a single species of tracer, and measures the covariance of 
the auto-correlation spectrum. For that reason I have called 
the window function above, W®, the self-correlation window 
function. By taking derivatives of the exact solution for the 
full phase space window function, Eq. ( |51[ ), it is possible to 
express the self-correlation window function in terms of the 
full isotropic window function: 



W°(k,k') 



(k 2 - fc' 2 ) 2 



£(2< + l) 



(63) 



[kx j' e (kx ) ji(k'x ) - k'x j' e (k'x )j e (kx ) 
1 7T / d d 



(k 2 



Xq 2 \d log x d log x' 







8 



1 log k d log k' 



W(k, x; fc', x'] 



Substituting the expression for W in the first line of Eq. ( 51 1 
into the last expression above we find: 



W°(k,k') 



1 



(64) 



16 fcfc'(fc 2 - fc' 2 ) 4 ^ 
x {8kk' [fc 2 + fc' 2 + (fc 2 - fc' 2 ) 2 ^] 
+ (fc-fc') 4 cos[2(fc + fc')a;o] 
-(fc + fc') 4 cos[2(fc-fc')a;o] 
+ 2(fc + fc')(fc - k'fx a sin[2(fc + k')x ] 
- 2(fc - fc')(fc + k') 4 x sin[2(fc - k')x ]} . 

A slightly condensed expression can be found using the fact 
that z 2 j-2(z) = cos z + zsinz, which leads to: 



W°(k,k') 



16 fcfc'Afc* Aktx% 



(65) 



x [8fcfc' (fc 2 + fc' 2 + Afc 2 Afc 2 xl) 
+ Ak 4 _ (2Ak+x ) 2 j. 2 (2Ak + x ) 
- Afc+ (2Afc_a:o) 2 j_ 2 (2Afc_x )] . 

In Fig. |4.1| this window function is plotted for some values 
of fc'. 

The self-correlation window function W° is positive ev- 
erywhere, and is highly peaked around fc = fc' when kxo 3> 1. 
However, in contrast to a delta-function, it only has support 
on a finite volume, and therefore it has the properties that 
both its width and its maximum height remain finite in the 
limit fc' — > fc: 



lim W°(k,k') = 

k'— >k 



(cos kxo + 4fca;o sin4fca;o 



-1 - 8fc 2 a;o + 32fc 4 a; 4 ) 
+0(k-k') . 



(66) 



In fact, when kxo 3> 1 the window function can be written 
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Figure 1. Window function k 2 x^ W%(k, k'), with k'xo = 0.1, 
1, 5, 10, 20, 30 and 40 (from left to right). For k'x < 1 the 
window function is essentially independent of k' for kxo <C 1. 
Notice that even for large k' the window function does not become 
more localized around k = k' - the width of the window function 
around the peak is always limited by the size of the survey, x^ 1 . 



lim lim W°(k,k') 

k' — yk kxQ — >oc 



+0{k-k' 



1 1 

8 ~ 36 

3 



(k — k') 2 XQ 



(67) 



This expression shows that even for arbitrarily small scales 
(kxo 1), the finite size of the survey limits the size of the 
volume in Fourier space inside which we can define a band- 
power that is linearly independent from the other bandpow- 
ers. The minimal width of bandpowers in the small-scale 
limit is, from the formula above, Afc m i n ~ 3/y2xg . 

One can also take the joint limits k — > k' and k — > 0, 
which then result in: 



lim lim W°(k,k') = i 

k' — yk kx — >0 9 



2,2 2 
-k X 



+ o(r 



(68) 



This limit shows why the classical approximation is inac- 
curate at large scales (small k): in that regime, the phase 
space window function is in fact independent of kl - i.e., on 
large scales the Fisher matrix is essentially an average over 
the phase space cells close to the origin. This result means 
that the first fc-bin of a survey has to include all the modes 
< k < ^l/Sxg 1 . This is, of course, a manifestation of 
cosmic variance, which tells us that no survey can measure 
structure on scales larger than the size of the survey itself. 

The preceding discussion implies that the optimal sizes 
of the bins both in the large-scale and in the small-scale 
regimes are always commensurate with the only other scale 
in the problem, Xq 1 . The only exception to this rule would 
be a spectrum P(k) which has a very sharp and well-defined 
feature at some particular scale, such that the spectrum it- 
self changes more rapidly than the window function near 
that scale. 

Hence, to summarize the results of this Section, we have 
found that the Fisher matrix for the power spectrum in the 
case of a survey with a top-hat number density is given by: 

F P (k,k') = ^-U (k)U (k')-x 6 W°(k,k') , (69) 

where U (k) = N P(k)/[l + N P(k)] and W s is given by Eq. 
|64|. 



Now, I will show that Eq. (691 is basically identical 



to the FKP Fisher matrix that was found, with a slightly 



different approach, by Hamilton [Hamilton ( 1997a |]. From 
that reference, considering only the lowest-order term in the 
series for the inverse of the covariance matrix, we get that: 



F 



(FKP) 



(*,*') 



d 2 k d 2 k' 
47t 4n 



U(ie + k') 



(70) 



where: 



U(k) = / <fxe w - x U{k,x) = 



ixc ik. s N(x)P(k) 
X£ I + N(x)P(k) ' 
(71) 

In the expression above, k corresponds to a "trial" wavenum- 
ber that should be chosen a posteriori in order to maximize 
the Fisher matrix (and minimize the covariance) for the 
bandpower that is being estimated. The scale k is in fact 
inherited from the inversion of the covariance matrix, un- 
der the approximation that it is diagonal. It is not entirely 
clear what sets the correct choice of k, but it has been com - 
mon practice to take k — > (k + k')/2 Hamilton ( 1997a|b I ; 



Tegmark|fi997| )]. 

Under the assumption of isotropy and with a top-hat 
effective number density N(x) — No6(xo —x), it is trivial to 
compute U in the classical limit: 



U(k;k) 



4nUo(k)k [sin kxo — kxo cos kxo] 
47vU {k)k- 3 (kxofMkxo) . 



(72) 



Substituting this expression into Eq. ( 70 1 we find that the 



integral is exact, and the result is in fact: 

\3 



F 



f KP) (M') = ^ U 2 ik) 2 xt W°(k,k') . (73) 



Now compare Eqs. (69 1 and (73 1: the phase space window 



function is precisely the same in the two expressions, and the 
only difference is that the latter equation takes k = k' = k 
in Uq- This is a good approximation only if the k bins are 
sufficiently large, in which case the binned window function 
is very nearly diagonal. 



4.2 FKP formulas: the stationary phase limit 

Now, let's compare the analytical result of the previous sec- 
tion with the corresponding FKP formulas (which are in the 
stationary phase limit). Because of the Dirac delta function 
in the FKP Fisher matrix, it is more convenient to compare 
the averages over bins fc;: 

d 3 k' 



F, 



P,ii — 



d 3 k 
(>r)3 



F P (k,k') 



(74) 



Vi 
(2tt) 3 
1 

2(2tt) 



Fp(k — ki,k' = kj) 



(2tt) 3 
8 -l7o(*i)E/b(fcj 



ViVj-xtW^ki 

7T 



where V, = 4irk 2 Aki is the volume of the shell in Fourier 
space around the i-th bin, and in the last expression I as- 
sumed that the binning is small enough that Fp does not 
vary too much inside the bin. 

The FKP Fisher matrix in the classical limit can be 



obtained directly from Eq. ( 55 \ by taking the SP approxi- 
mation, W(fe, x; k', x') — > k~ So{k — k') x~ 2 5d(x — x 



For 
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Figure 3. Rows i of the matrix (kiXo) 2 V^ shown in Fig. pfor 
the bins kx = 0.5, 5, 10, 15, 20 and 25 (j = 1, 6, 11, Id . . . 
51). Since Vi is proportional do fc 2 , I plot the volume times an 
adimensional pre-factor of (kiXo)~ 2 - Above kxo ~ 10 the rows 
of the volume matrix are essentially self-similar after normalizing 
for the pre-factor of (kxo)~ 2 . 



Figure 2. Self-correlation phase space volume matrix Vy with 
100 equally-spaced bins between kxo = and kxg = 50. 



our top-hat profile, the FKP Fisher matrix for the power 
spectrum in the classical limit is: 



lim F P ,ij 

class 



(2-) 3 



U (h) Si 



(2tt) 6 



(75) 



where Vo = 47rx|)/3 is the total volume of the survey (in 
position space, naturally). 

It is possible to compare the full expression for the 
Fisher matrix with its classical limit, in a way which is com- 
pletely independent of the phase space weighting function 
Uo - and, therefore, in a way that does not depend on ei- 
ther the effective density iVo or the fiducial power spectrum 
P(k). In fact, all we need to do is to compare the adimen- 
sional matrices associated with the phase space volume: 



y cla 3S = y, y o , 



v. V° j =V i V j -xgW?(fc<,*i), 

(76) 

In Fig. [2] I plot the self-correlation phase space vol- 
ume matrix Vy 3 binned in 100 equally spaced intervals of 
Afc = O.Sxg 1 between fc = and kxo = 50. In this 2D rep- 
resentation of the phase space volume matrix, darker colors 
denote higher values of phase space volume. Obviously, the 
classical counterpart of this matrix is the diagonal matrix 
Vfj aaa . In Fig. [3] I plot some of the rows of the volume ma- 
trix, to show how they are spread out over the k bins. In the 
classical limit, each curve would be a Dirac delta-function 
centered on k = k' . 

In Fig. [4] I compare the entries of the full phase space 
volume, Vij , with the normalization provided by the classical 
(FKP) approximation, Vj C j ass . The upper set of points (blue 
in color version) denote the diagonal elements of the phase 
space volume in the classical approximation. The lower set of 
points (yellow in color version) denote the diagonal elements 
of the full phase space volume matrix, Vu. Also plotted are 
the traces of the rows of the volume matrix, Vi = • Vij 
(middle set of points, red in color version). Since the phase 
space window function W is in fact normalized to unity over 



the whole volume of phase space, these traces should be 
equal to their classical limit counterparts. Fig.|4]shows that, 
indeed, the normalization of the full phase space volume ma- 
trix is very well approximated by the classical approximation 
on all but the largest scales - the difference is essentially due 
to the finite size of the bins. The fall-off seen on very small 
scales (the highest values of k) is just an artifact of cutting 
off the bins at the edge (kxo = 50 in our example). 

Inspection of Figs. |2|.'i| shows that, by increasing the 
size of the k bins, the full Fisher matrix can be well ap- 
proximated by the classical limit expression. Under these 
conditions the bandpowers are then approximately uncorre- 
cted - although we should always keep in mind that coarse- 
graining the Fisher matrix leads to loss of information, and 
that this lack of correlation only applies for bins of order at 
least Afc ~ 52(7 in our example of a top-hat effective den- 
sity. To put that into perspective, for a uniform Hubble-size 
(cHq 1 ~ /t -1 2997 Mpc) galaxy survey the bins would need 
to be only as large as Afc ~ 3.10 -4 h Mpc -1 for this approx- 
imation to be applicable, and for a survey spanning a tenth 
of a Hubble volume the k bins would need to be greater than 
Afc ~ 3.10 -3 h Mpc -1 . As a concrete example, consider the 
analysis of baryon acoustic oscillations on the SDSS-7 per- 
formed in Percival et al. (20101: if the volumetric galaxy 



density of that dataset were homogeneous and isotropic, the 
minimal size of the bins such that the bandpowers are ap- 
proximately uncorrelated should be of order Afc ~ 0.007 h 
Mpc - . However, in Figs. 1 and 3 of that paper the bins are 
spaced only by Afc ~ 0.004 h Mpc -1 , which means that the 
datapoints shown in those figures are highly correlated. For 



their statistical analysis, Percival et al. (20101 fitted cubic 
splines on nodes separated by Afc ~ 0.05 h Mpc -1 , which 
gives an effective bin size of approximately a quarter of that 
separation, i.e., Afc e // ~ 0.0125 h Mpc -1 , which is close to 
the limit I computed above assuming a top-hat density pro- 
file of galaxies. Using a more realistic distribution of galax- 
ies as a function of redshift and angular position in the sky 
would only make this problem worse. 

As mentioned above, one should keep in mind that by 
increasing the size of the bin we lose some amount of in- 
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Figure 4. Diagonal elements of the full phase space volume ma- 
trix (lower points, yellow in color version), compared to the same 
volume in the classical (FKP) approximation (upper points, blue 
in color version) . I also show the traces of each individual line of 
the phase space volume matrix (middle points, red in color ver- 
sion), which are very well approximated by the diagonal elements 
of the volume matrix in the classical approximation, simply be- 
cause the phase space window function is normalized (the end 
points are off due to the edges of those bins having been cut-off). 
I have employed 100 bins between kxo = and kxo = 50. 



formation by washing out the Fisher matrix. On the other 
hand, for practical and numerical purposes it is inefficient 
to keep an excessively large number of bins. In that sense, 
it is interesting to examine how many linearly independent 
modes are encoded in the Fisher matrix, by making a prin- 
cipal component analysis (PC A) on it. The result of the 
PCA decomposition fn our Fisher matrix with 100 equally 
spaced bins is presented in Fig. [5] where the principal val- 
ues are plotted as a function of k for the 100 bins. Fig. [6] 
shows the eigenvalues corresponding to the principal values. 
From Fig. [6] it can be seen that only about 35 components 
are relevant for this Fisher matrix, and from Fig. [5] we see 
that the highest-ranked ones probe the small scales (large k), 
whereas the lowest-ranked amongst the 35 non-trivial prin- 
cipal components span the large scales (small k). The 65 
lowest-ranking principal values have negligible eigenvalues, 
and it is clear that they carry no information whatsoever. 
This is an indication that the optimal average size of the 
bins in the case of a top- hat survey should be at most of the 
order of Afc ~ 50x Q ~ 1 /35 ~ IAxq 1 . However, in that case 
one should be careful to include the cross-correlations be- 
tween the different bandpowers, as not doing so would lead 
to an overestimation of the constraints. 



4.3 Analytical solution for two species of tracers 
with top-hat density profiles, including the 
cross-correlations 

Now I will generalize the results of the previous section to 
the case where we have two species of tracers. The main 
distinction with the previous sections is that now the total 
number density is the sum of two different top-hats, N(x) = 
Ni(x) + N 2 (x), where AT M (x) = iV M 0(z M - x). Here I will 
assume that the survey of species 1 is dense but shallow, 
and the survey of species 2 is sparse but deep, so Ni > N2 
and Xi < X2 - This would be the case, e.g., of a homogeneous 
survey of luminous red galaxies limited to z < 1, and a 
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Figure 5. Principal components of the Fisher matrix, binned in 
100 equally spaced intervals between kxo = an d kxo = 50. The 
horizontal axis corresponds to each principal component, ranked 
by their eigenvalues, and the vertical axis (from top to bottom) 
corresponds to the k bins. 
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Figure 6. Eigenvalues of the principal components of the Fisher 
matrix. It is clear from both Fig. [5] and [6] that there are only 
about 35 principal values with non- negligible eigenvalues, which 
means that only as many linearly independent bandpowers can 
be estimated from the data. The plots also show that, among the 
non- negligible principal components, the highest-ranked correlate 
with large values of k, and the lowest-ranked involve essentially 
the small-A; bins. 



survey of quasars or Ly-a absorption systems limited to z < 
3. 

With this in mind, we can write the phase space wcight- 
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ing function Uj of the previous Section as: 



x) + NiQ{x2 

-0(X! - x) , 



■x)]P(k) 



(77) 



1 + [Nrfixi 
NiP(k) 

i + N 12 p(ky 

where N± 2 = Ni+N 2 is the total effective number density for 
< x < x\. For the tracer species 2 the weighting function 
is expressed as: 

n 2 o(x 2 - x)P(k) 

1 + [Ni9{x! -x) + N 2 6(x 2 - x)]P{k) 
N 2 P{k) N 2 P(k) 



U 2 (k,x) = 



(78) 



+ 



1 + N 12 P(k) 
N 2 P{k) 



1 + N 2 P(k) 



(Xl 



x 2 



1 + N 2 P(k) 

Clearly, then, we can redefine the two effective densities so 
that they reflect the two distinct top-hats. Collecting the 
amplitudes of each individual top-hat profile, we obtain: 

N 12 P(k) N 2 P(k) 



U[{k,x) = 



U 2 {k,x) = 



f + N 12 P(k) 
U{(k)0(xi - a 

N 2 P(k) 
1 + N 2 P{k) 
U^{k)6(x 2 ~ 



1 + N 2 P(k) 



(x 2 - x) 



9{x! - a;) (79) 



(80) 



Therefore, when computing the total Fisher matrix 
for two species of tracers one should include the two self- 
correlation Fisher matrices, as discussed in the previous sec- 
tion, using either U[(k) for the self-correlation of species 1, 
or U 2 (k) for the self-correlation of species 2. In addition, we 
must also include the Fisher matrix for the cross-correlation 



between the two effective top-hats of Eqs. (79H8CM, which 
I discuss now. 

The Fisher matrix for the cross-spectrum arises from 
the cross-correlation between two species of tracers of large- 
scale structure. From Eq. ( |57[ l, and according to the discus- 
sion of Section [2] the Fisher matrix for the cross-correlation 
between the top-hat of radius x\ and the top-hat density of 
radius x 2 is given by: 



F P 2 (k,k') 



8tt 2 U{(k)U 2 (k') ^(2^ + 1) 

e 

dx x 2 je(kx) ji(k'x) 
dx' x' 2 ji(kx') je(k'x') 



(81) 



Using the same derivation that was used to arrive at Eq. 
162]), I get: 

I 



He (M') 



(k 2 - k' 2 ) 2 x 2 xl 

x£)(2* + l) S l(k,fc / )fl?(*,fc') , 



(82) 



where: 

g\{k,k') = kxi j'i(kxi) j e (k 'xi) - k' Xi j' e (k' x t ) je(kxi). (83) 

Since g}(k, k') is of order (D(k — k') in the limit k' — > k, 
the cross window function is well-behaved everywhere. The 
most important distinction between the self-correlation win- 
dow function W s and the cross-correlation one, W c , is that 




kxi 



Figure 7. Cross-correlation phase space window function 
k 2 x\ VK c 12 (fc, k'), with x 2 = 2x\. The modes plotted are, from left 
to right, k'xi = 1, 2, 5, 10, and 15. As was the case for the self- 
correlation window function, for k'x\ <S 1 the window function is 
essentially independent of k' . The most important difference be- 
tween the two window functions is that the cross window function 
can become negative - notice that I have plotted the log of the 
absolute value of k 2 W c , but the positive values are always found 
at the peak of the window function, and the spikes to negative 
infinity mark the transition from positive to negative values or 
vice-versa. The width of the window function around the peak is 
given by the inverse of the largest length scale of the survey (in 
this example, x 2 ). 

the former is always positive, whereas the latter is positive 
at its peak (at k' = k), but presents damped oscillations be- 
tween positive and negative values away from the peak. An- 
other important difference between the self-correlation and 
the cross-correlation window functions is that I was able to 
find an analytical expression for the former, but not for the 



latter, which is left in the form of the infinite sum, Eq. ( 82 1. 
The situation is not as dire as it may seem, since the spher- 
ical Bessel functions je(z) are highly peaked around z ~ £, 
and Limber-type approximations allow us to cut off the in- 
finite sum to a small number of terms with minimal loss of 
precision. In Fig. [7] I plot some of the modes of the cross 
window function in the case x 2 = 2x\. 

The full Fisher matrix for the power spectrum in the 
case of two top-hat profiles is therefore given by the combi- 
nation of the self-correlation and the cross-correlation terms 
corresponding to the two top-hats above, with amplitudes 
U[(k) and U 2 (k). The explicit expression is: 

F P (k,k') = 8tt 2 {xfU[(k)U[(k')Ws(k,k') 
+xtu! 2 {k)U 2 {k')W 2 {k,k') 
+ x\x A 2 [#(*)$(*') W e 12 (*,*') 
+U' 2 {k)U' 1 (k')W 2 c \k,k')\} , 

and the binned Fisher matrix is, therefore: 



(84) 



F P . 



in 



+ uL u' 2 



2 : "2, ] K, 



(85) 



2(2tt) 3 

+ [UliU^ + U^Uij] V?} , 

where V c is defined in terms of the cross-correlation window 
function W c in the same way as V s was defined in terms of 
W s in Eq. |76l: 



7T 



(86) 



The SP approximation to the full Fisher matrix can 
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be reached directly from Eq. (551 by taking the limit 
W(k,x;k',x') -> k~ 2 8 D {k - k')x^5 D (x 
integrating over the volumes of the k bins: 



x'), and then 



F 



1 



P,ij 



2(2tt) 3 

"47T 



Vi 5ij 



(87) 



+ 



4tt 



x 3 ,) Ui 



where U X2 ,i = N 12 P(h)/[l + N 12 P(ki)] =U{ + U' 2 . The rel- 
evant difference with respect to the analytical expression in 
the classical limit is that, because of the SP approximation, 
all the signal from the cross-correlation between f and 2 is 
already implicitly included in the amplitudes U\ 2 and U 2 . 
This becomes clearer if we rewrite the SP limit of the full 
Fisher matrix as: 



F; 



P,ij 



2(27r) 8 ^* y 3 

x [x\ U[ 2 :i + xlU' 2 % + x\ (U? 2ti - U'l 
2(2tt)3 Vi ° ij 3 

x [a,? U&+ xlU? :i + 2x 3 1 U[ ti U^] 



u: 



Compare now this last expression with Eq. ( 85 1. The self- 
correlation term for species 1 has already been analyzed 
in the previous sections, and the self-correlation term for 
species 2 is precisely the same, except for the scaling 
k — > k x x\/x 2 . The comparable cr oss- correlation terms are 



ViViSij and Vy, as defined in Eq. (|86l 



I show the cross-correlation phase space volume in Figs. 
[8] and [9] On Fig.§J the rows of the volume matrix are plot- 
ted for the first few k bins. As discussed above, the phase 
space window function (and, therefore, the volume matrix) 
can be negative for the cross-correlation term. Comparing 
Figs.[3]and[9] we see that the cross-correlation volume matrix 
is narrower around the diagonal. However, this is a simple 
consequence of the inclusion of the second species of tracer, 
whose top-hat profile has a radius x 2 = 2x 1 . Naturally, the 
width of the cross-correlation volume matrix in k bins is 
dominated by the inverse of the largest scale, which in this 
example is Afc ~ x^ 1 = O^x^ 1 . 

In Fig. [10] I compare the phase space volume with its 
SP approximation, for both the self-correlation terms and 
for the cross-correlation as well. In order to compare the 
volume matrices on an equal footing, I have normalized the 
volume of species 2 to the volume of species 1. In the up- 
per panel I have plotted ViVi, as well as the traces of each 
row of the volume matrices, > Vi/V 2 x Vi} an d 

x ^ . Vfj . From the upper panel we can verify that there is 
good agreement between the SP approximation and the an- 
alytical result in the classical approximation at intermediate 
scales, where the analytical result is only ~ 2-3% below the 
SP approximation. This means that, by taking large enough 
fc-bins one can recover the FKP result for the Fisher matrix 
from the result in the classical approximation. Only at the 
very largest scales (the single bin between < kx\ < 0.5) 
the SP approximation fails, and the FKP Fisher matrix un- 
derstates the constraining power of the survey. 

The lower panel of Fig. [fOjshows the ratios of the diago- 
nals of the phase space volume matrices to the traces of their 
respective rows - which is a measure of how diagonal those 
matrices are. Tracer species I, which occupies the smallest 
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Figure 8. Cross-correlation phase space volume matrix Vy , with 
100 equally-spaced bins between kxi = and kxi = 50, and 
assuming x 2 = 2xi. 
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Figure 9. Rows of fc? Vf- for the bins kx 1 = 0.5, 5, 10, 15, 20 
and 25 (from left to right, respectively). 



volume, is the least diagonal, and tracer species 2, which 
spans length scales roughly double those of species 1, has 
a more diagonal Fisher matrix, by a factor of two, approxi- 
mately. The cross-correlation Fisher matrix, in fact, appears 
to be the most diagonal of the Fisher matrices. However, 
that is partly an artifact coming from the wings of the cross- 
correlation window function, which are negative - see Fig2. 

) 9 When we account for this (by, e.g., using the squares 
of the window functions as the normalization), the cross- 
correlation Fisher matrix comes out to be approximately as 
diagonal as the self-correlation Fisher matrix of the tracer 
species with the largest volume - in our case, species 2. 

It is also useful to perform a PCA analysis on the cross- 
correlation phase space volume, as was done for the self- 
correlation volume in the previous section. On Fig.[TTJl show 
the principal components of V c as a function of the k bins. 
On Fig. [12] I show the eigenvalues of the principal values 
for the self-correlations of species 1 and 2, as well as the 
cross-correlation between the two species. 

The results of this Section can be easily generalized 
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Figure 11. Principal components of the cross-correlation Fisher 
matrix for the power spectrum, binned in 100 equally spaced in- 
tervals between kxg = and kxg = 50. The horizontal axis corre- 
sponds to each principal component, ranked by their eigenvalues, 
and the vertical axis (from top to bottom) corresponds to the k 
bins. 



Figure 10. Upper panel: the horizontal row of points are the 
classical and SP approximations to the phase space volume ma- 
trix for the cross-correlation between tracer species 1 and 2. The 
lower and upper solid lines (blue and green in color version) corre- 
spond to the trace of the rows of the self-correlation phase space 
matrices of species 1 and 2, respectively - i.e., yv V^ 1 ' 2 ^ . The 
dashed line (orange in color version) corresponds to the trace 
of the cross-correlation phase space matrix. Lower panel: ratios 
of the diagonal elements of the phase space volume matrices to 
their traces, Vu/Y2j Vy- From the lower to the upper lines, self- 
correlation of species 1 (lower solid line, blue in color version), 
self-correlation of species 2 (middle solid line, green in color ver- 
sion), and cross-correlation V£ (upper dashed line, orange in color 
version). I employ 100 bins between kx\ = and kx.\ = 50. 



to an arbitrary number of species of tracers, and to any 
(isotropic) number density - not only uniform densities. 



5 DISCUSSION 

In this paper I have shown how to compute the Fisher matrix 
for galaxy surveys, including the cross-correlations between 
different cells in position space and in Fourier space. In the 
stationary phase approximation these cross-correlations are 
discarded - Hamilton [Hamilto"n ( 1997a|c |] refers to this case 
as the "classical limit". 

I have also shown how to obtain the Fisher matrix for 
multiple species of tracers of large-scale structure from the 
covariance of counts of galaxies in cells. The final formulas, 
after taking the classical and SP limits, generalizes the re- 



sults previously obtained by Percival et al. (20031; White 
et al.| ( |2008| ); |McDonald fc Seljakj ( |2008| ). However, I have 
also obtained the Fisher matrix using only the classical limit, 
which solves some (probably minor) inconsistencies of those 
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Figure 12. Eigenvalues of the principal components of the Fisher 
matrices for the self-correlation of tracer species 1 with itself (left 
line and points, blue in color version), for the self-correlation of 
species 2 with itself (right line, green in color version), and for the 
cross-correlation between the two species (middle line, orange in 
color version). 



formulas. I have also shown that, in order to invert the co- 
variance matrix for the full dataset with all species of trac- 
ers, all that is needed is the inversion of a single matrix (or 
operator), C, and not the inversion of a large set of linear 
equations - see Eq. (21 1 and the following discussion. The 
main results are summarized by Eqs. (40l-(45l, and their 
classical limits are shown in Eqs. ( |46| l-(|48|. 

The full Fisher matrix in the classical approximation 



can be expressed entirely in terms of the phase space weight- 
ing functions U^(k,x) = Nf,(x) P(k)/[l + N(x) P(k)], where 
Nfj.(x) = n^B 2 ^ is the effective density of the tracer species 
[i. These weighting functions are basically the FKP pair- 
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weights. These results make the case that, just as P(k) is the 
density of modes in Fourier space, and N(x) = -^m plays 
the role of the total effective density of tracers in position 
space, the quantity |C/ 2 (fc, x) can be interpreted as the den- 
sity of information in phase space. The Fisher information 
matrix for the power spectrum is simply the sum (or trace) 
of the information over position-space volume [i.e., the effec- 
tive volume V e ff(k)], and the Fisher matrix for the bias is 
the sum (or trace) of the information over the Fourier-space 
volume [what I have called here V e ff{x)\. The elements of 
the Fisher matrix which mix the power spectrum estimation 
at k with the bias estimation at x are given simply in terms 
of |f/ 2 (the density of information), times the phase space 
volume occupied by the bins at k and at x. In a forthcoming 
paper (Abramo 2011, to appear) I will show how to use this 
result to jointly estimate the power spectrum and the bias 
(together with the matter growth function) from the same 
dataset, without introducing hidden priors; and conversely, 
how to properly include priors in these estimations. 
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